Towards Data Mining Without Information on Knowledge Structure

نویسندگان

  • Alexandre Vautier
  • Marie-Odile Cordier
  • Rene Quiniou
چکیده

Most knowledge discovery processes are biased since some part of the knowledge structure must be given before extraction. We propose a framework that avoids this bias by supporting all major model structures e.g. clustering, sequences, etc., as well as specifications of data and DM (Data Mining) algorithms, in the same language. A unification operation is provided to match automatically the data to the relevant DM algorithms in order to extract models and their related structure. The MDL principle is used to evaluate and rank models. This evaluation is based on the covering relation that links the data to the models. The notion of schema, related to the category theory, is the key concept of our approach. Intuitively, a schema is an algebraic specification enhanced by the union of types, and the concepts of list and relation. An example based on network alarm mining illustrates the process.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Organizational Factors on the Effectiveness of Knowledge Management among Nurses

Background and Objectives: Knowledge Management (KM) has emerged as a pathway towards competitive advantage in current complex industrial environment. The aim of the present study was to explore the relationship between KM effectiveness and various organizational factors including social interactions (trust, communication and coordination), infrastructure factors (structure, information technol...

متن کامل

Spam, Opinions, and Other Relationships: Towards a Comprehensive View of the Web Knowledge Discovery

Web mining” or “Web Knowledge Discovery” is the analysis of Web resources with data-mining techniques such as classification, clustering, association-rule or graph-structure methods. Its applications pervade much of the software Web users interact with on a daily basis: search engines’ indexing and ranking choices, recommender systems’ recommendations, targeted advertising, and many others. An ...

متن کامل

Applying Data Mining in Healthcare: An Info- Structure for Delivering 'Data-Driven' Strategic Services

Presently, there is a growing demand from the healthcare community to leverage upon and transform the vast quantities of healthcare data into value-added, 'decision-quality' knowledge, vis-à-vis, strategic knowledge services oriented towards healthcare management and planning. To meet this end, we present a Strategic Knowledge Services Info-structure that leverages on existing healthcare knowle...

متن کامل

Perform Three Data Mining Tasks with Crowdsourcing Process

For data mining studies, because of the complexity of doing feature selection process in tasks by hand, we need to send some of labeling to the workers with crowdsourcing activities. The process of outsourcing data mining tasks to users is often handled by software systems without enough knowledge of the age or geography of the users' residence. Uncertainty about the performance of virtual user...

متن کامل

Mining Web Documents for Unintended Information Revelation

This research concerns web site information security. With an increasing number of documents being generated by different individuals and departments in organizations, there is a potential of releasing information which is inconsistent with the overall goals, objectives and operation of the organization. We refer to this as unintended information revelation (UIR). This paper focuses on progress...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007